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ABSTRACT 

Surveys of faint galaxies at high redshifts often result in a "pencil-beam" geometry 
that is much longer along the line-of-sight than across the sky. We explore the effects 
of this geometry on the abundance and clustering of Lyman-break galaxies (LBGs) 
and Lyman-a emitters (LAEs) in current and future surveys based on cosmological 
N-body simulations which adequately describe the nonlinear growth of structure on 
small scales and compare to linear theory. We find that the probability distribution 
of the LBG abundance is skewed toward low values since the narrow transverse di- 
mension of the survey is more likely to probe underdense regions. Over a range that 
spans 1-2 orders of magnitude in galaxy luminosities, the variance in the number of 
objects differs from the commonly used analytic prediction and is not dominated by 
Poisson noise. Additionally, nonlinear bias on small scales results in a one-dimensional 
power spectrum of LAEs using a James Webb Space Telescope field-of-view that is 
relatively flat, markedly different from the expectation of linear perturbation theory. 
We discuss how these results may affect attempts to measure the UV background at 
high redshifts, estimate the relationship between halo mass and galaxy luminosity, and 
probe reionization by measuring the power-modulating effect of ionized regions. 

Key words: cosmology: theory - cosmology: observations - early universe - large- 
scale structure of the universe - galaxies: high-redshift - galaxies: abundances 



1 INTRODUCTION 

Over the past few years, searches for Lyman-break galax- 
ies (LBGs) and Lyman-alpha emitters (LAE) have revealed 
new populations of young, star-forming objects at redshift 
beyond z = 6 (e.g. 



young, star-lormmg objects at rcdshilt 
Stark et all l2007al : iRichard et all 2008 ; 



iBouwens et alJl200St 120091 ) that shed light on the star for- 
mation history at high redshifts and the sources responsible 
for cosmic reionization. In addition, evidence has been found 
for a population of massive, evolved system s around z = 5 
l|Mobasher et al.ll2005l : IWiklind et ai1 l2008 ) that hints at an 
earlier period of higher star formation than has yet to be 
seen in surveys of bright LBGs and LAEs. 

Theoretically, it is expected that the bulk of the star 
formation during reionization had taken p lace in less lumi- 
nous galaxies than previously observed(e.g. lBarkana fc Loebl 
l200ll : IWvithe fc Loeb|[200tl . Dwarf galaxies are supposed 
to supply the bulk of the radiation that reionizes the 
Universe as well as provide the building blocks of the 



(e.g. Kravtsov & Gnedin 20051: 


Gnedin & Kravtsovl 


2006; 


iMoore et al. 20061: Madau et al.l 


2008: Munoz et al.l 12009). 



Attempts to probe the faint end of the luminosity function 
have often sacrificed field-of-view for the sake of higher flux 



sensitivity. Objects in the 11.2arcmin 2 Hubble Ultra Deep 
Field (HUDF), for example, can be seen in the zsso.ab pass- 
band down to an apparent magnitude of about 29. In an 
even more extreme case, a single long-slit spectroscopic sur- 
vey for gravitationally-lensed LAEs has a field-of-view of 
only about 32 arcsec 2 (which is even smaller in the unlensed 
source plane) but can achieve large boosts in sensitivity if 
positioned on the gravitational lensing critical line of a fore- 
ground galaxy cluster. The James Webb Space Telescope 
(JWSTq), planned for launch in 2013, will have a better 
sensitivity, enabling observers to probe "pencil beams" out 
to even higher redshifts. 

Recent data analysis from deep and narrow galaxy sur- 
veys has begun to be incorporated into theoretical mod- 
els of structure formation and reionization. Several stud- 
ies have attempted to relate the observed UV and Lyman- 
alpha luminosities from these sources to the masses of 
the dark matter halos that host them by fitting results 
from simulations and semi-analytic prescriptions to the 
observations and so underst and the clustering and abun- 
e.g. iLe Delliou et al.ll2005l. 120061: 
Kobavashi et al.l 120071 : iNagamine et al.l 



dance of such galaxies 
Stark et al.l l2007bl " 



20081 : lOrsi eTaT. 2008; Kobavashi et al. 2009). In addition, it 
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has been shown that cosmic reionization in the high-redshift 
Universe can be probe d by observing its effe c ts on the galaxy 
power spectrum (e.s. iBabich fc Loebll2006l : iMcQuinn et al.l 
l2007l ; IWvithe fc Loebll2007D . 

In this paper, we study the underlying abundance and 
clustering statistics of halos in narrow fields-of-view. Our 
goal is to relate the theory of structure formation to the large 
amount of observational data from high-redshift galaxy sur- 
veys. We take into account the effect of large-scale density 
fluctuations as well as the nonlinearities on small scales. We 
use state-of-the-art numerical simulations to examine the 
field-to-field sample distribution and one-dimensional power 
spectrum of "pencil-beam" galaxy surveys that are much 
longer along the line-of-sight than across the field-of-view. 
Our analysis is designed to help observers in the interpreta- 
tion of related observational data in the future. 

While, similar studies of the cosmic variance and count 
distribut ions in high-redshift dropout surveys were ca rried 
out fe.g. iTrenti fc Stiavellil 120081; lOverzier et al.ll2009l) , our 
analysis differs from Trenti fc StiavelhT i 20081 ) in that we use 
our simulated halo catalogs to create mock surveys viewed 
along the light-cone and probes more deeply than both 
studies since our simulations can resolve halos as small as 
~ 10 s M Q . 

Our paper is organized as follows. In Sj2]we describe our 
simulations. The probability distributions of object counts 
in high-redshift pencil surveys are analyzed in Sj3] In Sj4] we 
explore the one-dimensional power spectrum obtained from 
such surveys, the nonlinear bias, and the usefulness of such 
measures in relating galaxy luminosity and halo mass. Fi- 
nally, we discuss our results and their implications for ob- 
servations in Sj5] 



2 THE SIMULATION DATA 

To simulate observations of high-redshift galaxies, we 
adopt halo catalogs from an N-body simulation and semi- 
analytically dress each halo with a luminosity based on 
its mass. Thes e catalogs were p roduced by the simu- 
lation used in iTrac et all (|2008l l which resolved halos 
with masses as low as ~ 10 8 Mq , almost two orders-of- 
magnitude smaller than those resolved in simulations by 
ITrenti fc Stiaveilil (fioOS), in a box of comoving size 143 Mpc 
(assuming a Hubble constant of Hq = 70.km/s/Mpc). 
This approach allows us to probe the faintest high-redshift 
galaxies within a sufficiently large simulation box that 
includes the important large-scale fluctuations. A flat, 
ACDM cosmology was assumed with cosmological parame- 
ters from the Wilkin son Microwave Anisotropy Probe 5-yea r 
data r elease (|Komatsu et al.1 120091 ; iDunklev et"al] |2009h . 
(n m , n A , fit, h, a s , n a ) = (0.28, 0.72, 0.046, 0.70, 0.82, 0.96). 

We determined a UV or Lyman-a luminosity for each 
halo in the simul ation according to the prescription of 
IStark et al.1 l|2007bl ) using their simple assumption that each 
halo hosts no more than a single bright galaxy. Their model 
for star formation assumes a fraction, of the baryons in a 
halo with mean baryon fraction forms stars in an amount of 
time given by the cosmic age at the observed redshift times 
the duty cycle eoc- The efficiency is assumed to be constant 
with mass for halos with circular velocity above 100 kms -1 , 
however, star formation in smaller halos is suppressed so 
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Figure 1. The contributions to the total variance (Eq. [T] solid 
lines) in LBG dropout surveys as a sum of cosmic variance 
(dashed lines) and Poisson shot noise (dotted lines) contributions 
(i.e. the first and second terms, respectively, on the right-hand- 
side of Eq.[l]l. The top and bottom panels show results for surveys 
extending from z=6-8 and z=8-10, respectively. Thin lines assume 
a luminosity threshold of zgrM, AB=29, while for thick ones, the 
cut is at z 850 ,ab=27. 



that the efficiency is a power-law with mass to the 2/3 power. 
The resulting star formation rate is c onverted to a UV lu - 
minosity (for LBGs) according to the iMadau et al.l (|l998T l 
model or to a Lyman-a luminosity (for LAEs) by assuming 
a fixed fraction, T a , of the 4 x 10 53 ionizing photons emitted 
per star formation rate in Mq /yr (for a Salpeter IMF) escape 
the galaxy and the IGM. For a particular type of galaxy, lu- 
minosity L is associated with halo mass Mhaio so that the 
abu ndance of observed syste ms with luminosity L is equal to 
the ISheth fc Tormenl (|l999T ) abundance of halos with mass 
M times the duty cycle. We adopt the IStark et all (|2007bT ) 
best-fit values of (e DC , /*) = (0.25, 0.16) and (1.0, 0.063) for 
LBGs and LAEs, respectively. Since enc < 1 for LBGs, we 
appropriately consider only a fraction, eoc, of them to be 
observed in each snapshot of a galaxy survey. 

Finally, since we hope to present results that are not 
too model-dependent, we will continue to refer to both the 
galaxy luminosity and halo mass in the calculations that 
follow and in relevant figures. In this way, future models 
deemed more appropriate may be applied to our results for 
a given halo mass. 



3 PROBABILITY DISTRIBUTION OF 
GALAXY COUNTS 

In this section, we create mock samples of LBGs in pencil 
beams surveys at different redshifts. Since photometric red- 
shifts of LBGs are typically imprecise, we consider groups 
of these galaxies in the redshift intervals of z = 6-8 and 
z — 8-10. Many LBGs already have been observed in the 
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Figure 2. The upper panel shows relative contributions to the 
fractional variance in the number counts of galaxies as a func- 
tion of UV luminosity, z-band magnitude, or host halo mass in 
counts of LBGs in a dropout survey spanning the redshift inter- 
val z = 6-8 with a 3.4' X 3.4' field-of-view. Solid lines show the 
total variance, while long-dashed and dotted lines show the con- 
tributions from sample variance and Poisson noise, respectively. 
Red curves show simulation results, while black curves were cal- 
culated analytically. Vertical lines brackett the region where the 
variance is higher than expected due to the skewness of the full 
count probability distribution but is not Poisson-dominated. The 
lower panel shows the skewness of the full galaxy count probabil- 
ity distribution calculated from the simulation (Eq. 0). 



HUDF around z = 6 (e.g. iBouwens fc Illingworthl 120061 ; 
iBouwens et al.l l2006t ) , but observers have not yet had the 
same success at higher redshifts (e.g. IBouwens et al.l 120081 . 
2009). Given its higher sensitivity, it is hoped that JWST 
will be able to observe LBGs at these redshifts. 

To construct our samples of LBGs, we stitch together 
simulation slices in the given redshift range so that each sur- 
vey volume is viewed along the light-cone. This allows the 
mass function of halos to evolve along the line-of-sight, as 
needed to represent hig h-redshift surveys an d to correctly 
analyze their statistics |Munoz fe Loebll2008l ). For simplic- 
ity, we have assumed a top-hat selection function over each 
redshift interval. 

The time spacing between simulation snapshots is 
10 Myr. This means that there are 29 and 16 simulation 
slices between z = 6 — 8 and z = 8 — 10, respectively. The 
comoving sizes of these slices vary from 21.1 Mpc at z — 6 
to 27.3 Mpc at z = 8 and 32.6 Mpc at z = 10. We include 
in our z — 6 — 8 lightcone catalog halos from a slice 21 Mpc 
thick at one end of the simulation snapshot at z = 6, those 
from the adjacent slice 21 Mpc thick in the snapshot taken 
10 Myr earlier, and so on until we include the halos in a slice 
27 Mpc thick at z = 8. The z = 8 — 10 lightcone catalog is 
produced similarly. 

The comoving distances within the redshift intervasl 
z = 6 - 8 and z = 8 - 10 are about 700 and 500 Mpc, 



respectively, while the simulation box is only 143 Mpc on 
a side. In the process of stitching together slices, when we 
come to the end of the simulation box, we simply rotate 
the box and again stitching together slices through the box. 
We make the rotation by giving new coordinates (x[,x' 2 , x' 3 ) 
to each halo with previous position (xi,X2,X3) such that 
(x' 1 ,x' 2 ,x' 3 ) = (x2,xs,xi); we also make the additional ro- 
tation (x[, x' 2 , x 3 ) = (xi, X3, x 2 ) every third time we run 
through the simulation box so that there is no repetition 
in skewers along the line-of-sight. These rotations suppress 
power in the survey on the scale of the simulation length. 

For each redshift range, we calculate the field-to-field 
variance in the count of LBGs in a "pencil beam" of a given 
field-of-view using both linear theory and from the simu- 
lation data. The total variance in a survey is the sum of 
contributi ons from cosmic varia nce and Poisson shot noise 
(see, e.g.. lSomerville et aljlioolh : 



(TV 2 ) - (TV) 2 



„2 
°hh 



2 + (N) 



(TV) 2 (TV) 2 (TV) k 

where (TV) is the mean number of LBGs in each skewer. 
The cosmic (or sample) variance (the first term on the right 
hand side of Eq. [1} results from the field sometimes lying in 
a region of high density with a lot of structure due to the 
clustering of galaxies and sometime lying in an under-dense 
region or a void. This contribution can be calculated from 
linear perturbation theory (i.e. based on the linear power- 
spectrum, P(k)) as a function of the minimum halo mass, 
M, hosting observed galaxies as 



al h (M) 



(b eff (M)D(z)) 2 
(2tt) 3 



P(k) wl 



,d 3 fe, 



(2) 



where W xyz = W {k x ) W (k y ) W (k w ) , k = y/kJ+W+l^, 
D(z) is the linear growth factor evaluated, for simplic- 
ity, at the midpoin t of th e given redshift range, and b e // 
is the ISheth et al.l (|200ll ) linear bias in tegrated over all 
mass es above M and weighted by the (ISheth fc Tormen 
1999, hearafter ST) halo mass function IIMatarrese et al 



19971 ; iMunoz fc Loebll2008r ). The window function, W{k 
is the Fourier transform of a top-hat in the i-th dimension 
and is given by: 

sin(fci Oi/2) 



W{ki) = 



(3) 



ki Q>i 1 2 

In equation ((3]), a x and a y are the narrow dimensions of 
the skewer-shaped survey evaluated at z. The subscript w 
refers to coordinates along the line-of-sight. The length of 
the survey, aw, is given by the comoving distance between 
the redshift range of interest. 

To calculate the survey statistics, we take 9 = 3.4' (the 
field-of-view the HUDF and approximately of JWST) and 
count the number of LBGs in each of the 256 (225) pos- 
sible skewer positions perpendicular to the front face of 
our z — 6 — 8 [z = 8 — 10) lightcone simulation volume. 
While nearby skewers may be correlated, distant skewers 
are slightly anticorrelated. Taking each possible skewer ex- 
actly once is equivalent to choosing an infinite number of 
randomly located skewers since each skewer position will be 
sampled the same number of times on average. Moreover, 
given that we expect correlations on the size of the simula- 
tion box to be negligible, there is no difference between this 
statistical measurement and one with only a single skewer 
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Figure 3. Same as Figurc[2]but for the redshift interval z = 8-10. 



and multiple realizations of the simulation volume are pro- 
duced using different initial conditions. 

Figure [T] compares the contributions from cosmic and 
Poisson variance as calculated by linear theory (Eq. [1] and 
01 for function of the opening angle of the 

survey, 9 = a x /x{z). This diagram can be used to calculate 
the effectiveness of future surveys with large fields of view. 
Given the limiting size of our simulation volume, a sufficient 
number of skewers is not obtainable to perform comparable 
numerical calculations for large surveys. However, figures [2] 
and [3] show the agreement for the HUDF field-of-view. 

Figures [2] and [3] compare the contributions to the vari- 
ance in the simulations with calculations based on linear the- 
ory for the z = 6—8 and z = 8—10 ranges, respectively. They 
suggest that the observed statistics are well approximated 
by the analytic calculations at the low luminosity limits. 
However, while linear theory predicts a Gaussian probabil- 
ity distribution of the count of halos with variance given by 
equation [5] the simulated probability distribution of bright 
LBGs has a non-Gaussian shape. This can be clearly seen by 
eye in Figure [4] where the solid histogram representing the 
measured probability distribution is best represented by a 
log-normal distribution. The skewness as a function of mini- 
mum luminosity is presented in the bottom panels of figures 
[5] and |3] We define the skewness as the third moment of the 
probability distribution normalized by the variance to the 
3/2 power: 



S3 



((iV-(iV)) 3 ) 



3/2 



(4) 



The seemingly large amplitude variations in the skewness 
at low luminosity for z — 6-8 are due to small numeri- 
cal fluctuations around the nearly zero skewness from the 
simulation, plotted on a log scale. Deviations between the 
analytic and simulation values of the sample variance grow 
as the skewness becomes more significant. This behavior is 
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Figure 4. The full probability distribution for the number TV of 
LBGs in a dropout survey spanning the redshift interval z = 8- 
10 with a minimum galaxy luminosity of 10 28 ergs/s/Hz. The 
solid black curve shows the distribution as a histogram measured 
from the simulation, while the red dot-dashed, cyan long-dashed, 
and green short-dashed lines overlay Poisson, Gaussian, and log- 
normal distributions with the same mean, mean and standard 
deviation, and log-mean and log-standard deviation, respectively. 
The vertical, dotted line denotes the mean number of objects in 
the survey over many pointings. The distributions are plotted as 
differential probability in a differential abundance bin on the left, 
vertical axis as a function of the measured abundance of LBGs, 
N, on the lower, horizontal axis and as a function of the overden- 
sity in the abundance <5jv = N/ (N) — 1 on the upper, horizon- 
tal axis. Additionally, the counts of pointings per abundance bin 
for the simulation histogram is shown on the right, vertical axis. 
These counts per bin show that Poisson errors in the simulation 
histogram are smaller than the differences between the overlaid 
distributions. The simulation results match most closely with the 
skewed log-normal distribution. 



a manifestation of nonlinear clustering on the small scales 
probed by the narrowness of the skewer. 

Finally, the figures show sharp regimes where the field- 
to-field variance is dominated either by cosmic or Poisson 
variance. For the two redshift ranges z = 6-8 and z = 8- 
10 this division comes at around L (M = 5 x 1O 1O M0) = 

28 /Tjr „„ j r 1 -\jr r> i c\\Q ] 



6 
10 



x 10 zo ergs/s/Hz and L [M = 2 x 10 A/q) = 3 x 
28 ergs/s/Hz, respectively. The nonlinearity at higher lu- 
minosities causes the sample variance to dominate at masses 
which are ~ 50-100% larger than otherwise expected. How- 
ever, since the Poisson variance is unaffected by nonlinear- 
ities, there remains only a specific mass range (shown in 
the figures) over which the total variance is different than 
expected. 
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Figure 5. The ID power spectrum of halos hosting LAEs in 
a survey with a 3.4' X 3.4' field-of-view at Z = 6. The black 
(blue) lines represent values for a minimum Lyman-a luminos- 
ity of LLa.min = 10 40 (10 41 ) ergs/s corresponding to M^ a i = 
1.6 X 10 9 (6.4 X 10 9 ) Mq. In the upper panel, solid lines show the 
average log of the Poisson-subtractcd, dimcnsionlcss power spec- 
trum. Long-dashed lines indicate the estimated 1 — a variation 
in the log of the amplitude of the power spectrum from skewer- 
to-skewer. For clarity, these lines only appear for the lower lumi- 
nosity threshold, but they are about the same size for the higher 
luminosity one. Short-dashed, green and red lines show the lin- 
ear theory predictions for the dimensionless halo power spectrum 
given the lower and higher luminosity thresholds, respectively, 
while the dotted-dashed, magenta curve shows the linear theory 
calculations for the dark matter. The lower panel shows the non- 
linear bias (Eq. [Sj measured from the simulation (solid lines) and 
the linear, ST bias (short-dashed lines). 



4 LINE-OF-SIGHT (ONE-DIMENSIONAL) 
POWER SPECTRUM 

Next, we consider the one-dimensional (ID) galaxy power 
spectrum measured along the line-of-sight of a "pencil- 
beam" survey at high redshift with the same 3.4' x 3.4' 
field-of-view considered in the previous section. We explore 
what the information along the radial direction can or can- 
not tell us about the nature of high-redshift galaxies given 
the narrow field-of-view. We also examine whether or not it 
can be used to distinguish between luminous galaxies hosted 
by different mass halos, and ultimately to probe reioniza- 
tion. We are considering the "galaxy" power spectrum in 
the sense that we apply a duty cycle for LAEs and translate 
halo masses into hosted Lyman-a luminosities, but our re- 
sults do not include the effects of reionization on the power 
spectrum that we expect to observe in real surveys (e.g. 
Babich fc Loebll2006l : iMcQuinn et al.ll2007l : IWvithe fe Loebl 
20071 ). 



When analyzing the ID power spectrum, we do not at- 
tempt to produce mock surveys viewed along the light-cone 
since we wish to study the bias of the halos hosting high- 
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Figure 6. Same as Figure [5] except at z = 10. The minimum 
Lyman-a luminosity of Ll q m j n = 10 40 (10 41 ) ergs/s corresponds 
to A/ halo = 8.2 x 10 8 (3.2 x 10 9 ) M & at this redshift. 



redshift galaxies and compare our results to analytic, linear 
theory. Rather, we consider snapshots at z — 6 and z = 10. 
For this purpose, it is preferrable that the evolution of the 
halo mass and correlation fun ctions will not introd uce fea- 
tures in the power spectrum (|Mufioz fc Loebl 120081 ). Addi- 
tionally, stitching together simulation slices to produce a 
view along the light-cone would unnecessarily add or sub- 
tract (depending on the prescription used) power from the 
spectrum on scales larger than the size of the simulation box 
and on the scale of the separation between simulation slices. 
As shown in Figures [S] and [5] the 143 Mpc comoving size of 
the simulation box is sufficiently large to reproduce the lin- 
ear results on the largest scales. We plot our results only for 
values of k larger than 0.1 Mpc -1 to avoid confusion from 
power loss due to the finite box size. 

We calculate the spectrum in terms of real-space wave- 
numbers, as opposed to ones in redshift-space, and ignore 
cont amination of distanc e measurements by peculiar veloci- 
ties (|Monaco et al.|[2005l ) since they are expected to be small 
in the early universe. The redshifts of LAEs can be con- 
strained to within a few hundred km/s, corresponding to a 
wave-number of k ~ SMpc" 1 , while the distances to high- 
redsh ift LBGs can be uncertain t o hundreds of comoving 
Mpc (|Bouwens fc Illingworthll2006l ). 

We compare the power spectrum calculated from the 
simulation using a ID Fast Fourier Transform (FFT) to that 
determined analytically from linear perturbation theory. 
The theoretical ID power spectrum is obtained by integrat- 
ing the 3-dimensional power spectrum o ver a window func- 
tion corresponding to the field-of-view l|Kaiser fc Peacockl 
Il99ll) : 

Pm(k w ,M) = (6e//( 5^ (z))2 J P{k)W 2 xy dk x dk y , (5) 
where M is the minimum halo mass of the LBGs under con- 
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sideration, k — \Jk% + k y + k%, , W xy = W (k x ) W(k v ), and 
W(ki) is the same as in equation [3] The power measured 
in the simulation, however, includes an additional, scale- 
independent component from Poisson noise. This, too, can 
be calculated analytically as 

Ad, Poisson (Af) = (n(> M) a x a y a w Q{a w )y 1 (6) 

and subtracted off, where n(> Af) is the ST halo mass func- 
tion of LAE hosts with mass greater than M and 

Q(a w ) = — / W 2 (k w )dk w . (7) 

Here, a z can be any length scale and represents the depth of 
the skewer for which a 2 , = (TV (> A/)) — n(> M) a x a y a w 
is the Poisson variance in the number of LAEs. Since, 




W 2 {k w )dk w ~ 2/a 



Q(a w ) oc I/aw, and so Pi d, Poisson (Af) is approximately in- 
dependent of a w , as expected. From equation O we can see 
that the ID power spectrum at a given scale is a complicated 
convolution of shorter wavelength modes with the window 
function. This integration washes out any features that may 
have existed at small scales. Thus, we expect the ID spec- 
trum to be relatively featureless and smooth. 

We analyze the power spectrum for as many skewers 
through our simulation box that can be packed into the 
143 Mpc size, 17 and 15 for z — 6 and z = 10, respectively, 
before rotating the box to a new orientation and collecting 
additional sets of skewers. As in 33 this scheme is equiva- 
lent to choosing an infinite number of skewers each placed 
randomly on the face of the simulation box. The argument 
also applies to additional skewers gained through rotation 
of the simulation volume. 

Figures [5] and [6] compare analytic and simulation cal- 
culations of the ID power spectrum and bias after cosmic 
reionization is assumed to be completed (z — 6) and in its 
early stages at z = 10, respectively. In both figures, the 
black (blue) lines represent values for a minimum Lyman- 
a luminosity of Li a>In jn = 10 40 (10 41 ) ergs/s correspond- 
ing to M ha io = 1.6 x 10 9 (6.4 x lO 9 )Af at z = 6 and 
M ha i = 8.2 x 10 8 (3.2 x 10 9 ) M at z = 10. While LAEs 
are currently detected to approximately this luminosity at 
z = 6, future surveys should be able to probe down to the 
same luminosity at z = 10. Solid lines show the average log 
of the Poisson subtracted power spectrum (represented as 
A? D (ft) = kP 10 (k)/n) measured from 3 x 17 2 and 3 x 15 2 
skewers for z — 6 and z = 10, respectively. The factor of 
3 comes from additional independent skewers we obtained 
from rotating the simulation box. 

Long-dashed lines indicate the estimated 1 — a varia- 
tion in the log of the amplitude of the power spectrum from 
skewer-to-skewer. For clarity, these lines are only shown for 
f/La.miii = 10 40 ergs/s, but they are about the same size for 
f/La.miii = 10 41 ergs/s. An estimate of the power from a sin- 
gle skewer will fall within the 1 — a bounds 68 percent of 
the time. The standard deviation of the log of the power 
is approximately the same for all redshift and luminosity 
threshold combinations we considered with a w 0.4; it is 
also approximately independent of scale. The precision to 
which one is able to recover the true mean power spectrum 
is given by the standard error: er e = o /V N s kewers- Assuming 



that our estimate of the standard deviation is close to the 
true value and given our number of independent skewers, we 
were able to estimate the mean of the log of the power to 
within a? « 0.02. 

The level of error in measuring the power informs 
whether the ID power contains sufficient information to dis- 
tinguish between galaxies of different masses and luminosi- 
ties. This differentiability is important in constraining the 
mass-luminosity relationship and happens when the stan- 
dard errors for each power spectrum using iV s keweis inde- 
pendent fields-of-view (in this case all having dimensions 
of 3.4' x 3.4') are small enough that they don't overlap: 
\ftei — /U4o| = (o"4i + C4o)/V TVskcwcrs, where fi x and a x are 
the mean and standard deviations of the log of the power 
for the luminosity thresholds corresponding to LL Q ,mm = 
lO^ergs/s. On the largest scales probed, the mean of the log 
of the power differs by w 0.23 for both redshifts (with a 
weak scale dependence) and 0-41 w 0-40 w 0.4. Thus, approx- 
imately 12 independent skewers are needed to distinguish 
between the two luminosity thresholds based on their ID 
power spectrum. 

The short-dashed green (red) line shows the analytic 
power-spectrum according to linear perturbation theory, 
with the same luminosity threshold as the black (blue) line. 
With no fit other than the calculations described above, the 
analytic predictions match the data from the simulations 
quite well on the largest scales. As soon as nonlinearities 
dominate on small scales, the simulation results and the an- 
alytic calculations diverge in their predictions. However, it 
is important to note that the nonlinearities at work are on 
even smaller scales than the deviation scale due to aliasing. 

In the bottom panels of Figures [S] and [S] we show the 
halo bias, 

i=\fej^t, (8) 

measured from the simulation (solid lines) and compare the 
results to the analytic ST calculation (short-dashed lines). 
Here, Pyj <^ ic is the linear, analytic calculation of the ID 
dark matter power from equation ([5| with b e g(M) = 1. 
These plots show the same trend as in the upper panels 
with increasing divergence between simulation and analytic 
calculations on small scales. This is precisely t he result ob- 
tained from the full three-dimensional power bv lTrac fc Cer] 
(2007). However, note that the linear and nonlinear values 
do not match up exactly for LL a ,min = 10 41 ergs/s at z — 6. 
The ST mass function is not a perfect fit over all masses and 
so it results in a slight deviation in the derived bias. This 
effect also manifests itself in the upper panel as a small de- 
viation on large scales between the simulation and analytic 
power for the same luminosity and redshift. 

A striking aspect of the ID power spectrum illustrated 
in the top panels is that, for the 3.4' x 3.4' field-of-view, it re- 
mains relatively flat and featureless over a range of fc-values 
spanning an order-of-magnitude or more. This smoothness 
is expected since both the aliasing of small-scale power and 
the averaging of many skewers wash out small features in 
the spectrum. However, the flatness is unique to our choice 
of field-of-view. 

In Figure [7] we investigate how observational results 
from different fields-of-view can been compared by plotting 
the ID power simulated in 3.4' x 3.4' pencil-beam skewers 
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Figure 7. A comparison of the ID power spectrum of halos host- 
ing LAEs, as in Fig. [6] in surveys with different fields-of-view at 
z = 10. We set Llc* min = 10 40 ergs/s for each survey. Thick lines 
show the simulated average log of the Poisson-subtracted, dimen- 
sionless power spectrum, while thin ones represent calculations 
from linear theory. Solid, black lines correspond to a 3.4' X 3.4' 
field-of-view, while the short-dashed, blue and long-dashed, red 
lines denote 1' X 1' and 10' X 10' fields-of-view, respectively. The 
field-to- field variance from Figures \5\ and [6] have been omitted for 
clarity. For reference, we have plotted the linear theory spectrum 
for the dark matter assuming a pure ID survey with an infinites- 
imal field-of-view as the dotted-dashed, magenta curve. 



alongside those using 1' x 1' and 10' x 10' fields-of-view. The 
behavior in the interesting range of modes differs among the 
three cases because of the different modes allowed in the 
integration by the window function, which mixes up modes 
in a way that cannot be easily de-convolved (see Eq. [5|. 

Comparing the the linear theory predictions, we see that 
the position of the turn-over is directly related to the size of 
the window function with the narrower field-of-view turning 
over at larger k. Although the simulated position of the turn- 
over still increases monotonically with field-of-view size, the 
aliasing of nonlinear small-scale power changes the window 
function convolution in such a way that the position of the 
peak is not easily determined from the linear case. 

It is also clear that the flatness we noted in the simu- 
lated spectrum for the 3.4' x 3.4' field-of-view is not generic; 
the spectrum for the narrowest field-of-view first rises with 
k and then decreases. The 10' x 10' field-of-view produces a 
spectrum that is flatter than the one produced by the 1' x V 
field-of-view, but since the peak is also at much larger scales, 
we end up probing the spectrum in the regime where it be- 
gins to fall off at small scales. The extra bumps and wiggles 
in the curve result from fewer skewers of that size being 
available for averaging. 

For reference, we have also shown in Figure [7] the ID 
dark matter power spectrum in an idealized ID survey with 
an infinitesimal field-of-view. The aliased power in this case 
increases the amplitude of the spectrum even more than the 



bias factor in the LAE spectra using realistic fields-of-view. 
The figure shows the ID power approaching this limit as the 
field-of-view for realistic surveys decreases. 



5 CONCLUSIONS 

In this work we have analyzed the statistics of galaxies 
in mock "pencil beam" surveys of high-redshift LBGs and 
LAEs with narrow fields-of-view produced using N-body 
simulations. We computed the effect of sample variance and 
nonlinear bias and showed how these will influence measure- 
ments of galaxy counts in existing and upcoming surveys. 

While the variance due to fluctuations in the local over- 
density in LBG surveys increases almost monotonically with 
increasing luminosity (or halo mass), it dominates only at 
low luminosities and is overtaken at high luminosities by a 
Poisson variance that rises more steeply as the average num- 
ber of galaxies in the survey drops. However, the overdensi- 
ties become nonlinear on small scales. A given point is more 
likely to be in underdense regions than in overdense regions 
because of their larger volume filling fraction. The resulting 
skewness of the probability distribution of galaxy counts cre- 
ates slightly more uncertainty in obtaining the true number 
density of galaxies at the observed redshift than would oth- 
erwise be expected. This can be seen graphically in Fig. [4] 
where the simulation histogram is most closely matched by 
a skewed, log-normal distribution whose peak is centered 
on negative values of the overdensity. The skewness of the 
distribution and the resulting increased probability of sur- 
veying an underdense region should be taken into account 
when interpreting galaxy counts in the context of the ion- 
izing UV background they produce at high redshift, or else 
it would result in an underestimate of the background. Pre- 
cise measurements of the cosmic variance in a survey is also 
important for understanding the differences between two in- 
dependent samples. Our results and methods can be used 
to calibrate these errors for different fields-of-view and sur- 
vey depths appropriate to a given study instead of relying 
only on linear theory estimates at a single redshift as in 
iBouwens et"aH (|2008h . 

Whether or not future surveys of LAEs will have the 
angular resolution to compute angular correlation functions 
in the plane of the sky, they will probe many more modes 
along the line-of-sight. We expect the ID power spectrum 
from surveys with a JWST-sized field-of-view to exhibit a 
roughly flat, smooth profile. Aliasing and the averaging of 
many skewers are expected to result in a relatively feature- 
le ss spectrum unable to p robe void structure like that seen 
in lBroadhurst et al.[ (1990). The flatness, on the other hand, 
is unique to the particular choice of field-of-view, and com- 
paring JWST results, such as measurements of the galaxy 
bias, with those from other surveys will not be trivial due 
to the complicated convolution with the window function. 
Nonetheless, if the survey parameters are known, the ampli- 
tude on the largest scales matches well with the expectation 
from linear theory. However, on small scales, the nonlinear 
bias becomes large. Given their limited fields-of-view, a sur- 
vey with JWST will need tens of pointings to be able to dis- 
tinguish between host halo masses different by an order-of- 
magnitude limiting its usefulness in determining their mass- 
to-light ratios. Finally, our calculations ignored the complex 
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effects of reionization on the amplitude of the power spec- 
trum. The mass-to-light ratio for LAEs at redshifts during 
reionization could first be calibrated by extrapolating de- 
terminations made after reionization, through either clus- 
tering or abundance matching. With this information in 
hand, differences between our results and observations will 
show the effect of Lyman-a transmission through the HII 
regions around ionizing sources. However, probing reioniza- 
tion through the chan ging amplitude of the ID spectrum 
|McQ umn et al.ll2007h will require particular consideration 
of the field-of-view. 
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